Overview

Dataset Statistics

Number of Variables 21
Number of Rows 41188
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 12
Duplicate Rows (%) 0.0%
Total Size in Memory 30.3 MB
Average Row Size in Memory 770.4 B
Variable Types
  • Numerical: 9
  • Categorical: 12

Dataset Insights

duration is skewed Skewed
campaign is skewed Skewed
pdays is skewed Skewed
emp.var.rate is skewed Skewed
cons.price.idx is skewed Skewed
cons.conf.idx is skewed Skewed
euribor3m is skewed Skewed
nr.employed is skewed Skewed
month has constant length 3 Constant Length
day_of_week has constant length 3 Constant Length
previous has constant length 1 Constant Length
emp.var.rate has 17191 (41.74%) negatives Negatives
cons.conf.idx has 41188 (100.0%) negatives Negatives
  • 1
  • 2

Variables


age

numerical

Approximate Distinct Count 78
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean 40.0241
Minimum 17
Maximum 98
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • age is skewed right (γ1 = 0.7847)

Quantile Statistics

Minimum 17
5-th Percentile 26
Q1 32
Median 38
Q3 47
95-th Percentile 58
Maximum 98
Range 81
IQR 15

Descriptive Statistics

Mean 40.0241
Standard Deviation 10.4212
Variance 108.6025
Sum 1.6485e+06
Skewness 0.7847
Kurtosis 0.7911
Coefficient of Variation 0.2604
  • age is not normally distributed (p-value 0.000968517201938162)
  • age has 469 outliers

job

categorical

Approximate Distinct Count 12
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3046068

Length

Mean 8.9552
Standard Deviation 2.1643
Median 10
Minimum 6
Maximum 13

Sample

1st row housemaid
2nd row services
3rd row services
4th row admin.
5th row services

Letter

Count 347751
Lowercase Letter 347751
Space Separator 0
Uppercase Letter 0
Dash Punctuation 10675
Decimal Number 0

marital

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2958580
  • The largest value (married) is over 2.15 times larger than the second largest value (single)

Length

Mean 6.8311
Standard Deviation 0.6036
Median 7
Minimum 6
Maximum 8

Sample

1st row married
2nd row married
3rd row married
4th row married
5th row married

Letter

Count 281360
Lowercase Letter 281360
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (married, single) take over 50.0%
  • The largest value (married) is over 2.15 times larger than the second largest value (single)

education

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3200759

Length

Mean 12.711
Standard Deviation 4.3889
Median 11
Minimum 7
Maximum 19

Sample

1st row basic.4y
2nd row high.school
3rd row high.school
4th row basic.6y
5th row high.school

Letter

Count 471587
Lowercase Letter 471587
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 12513
  • The top 2 categories (university.degree, high.school) take over 50.0%

default

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2802584
  • The largest value (no) is over 3.79 times larger than the second largest value (unknown)

Length

Mean 3.0437
Standard Deviation 2.032
Median 2
Minimum 2
Maximum 7

Sample

1st row no
2nd row unknown
3rd row no
4th row no
5th row no

Letter

Count 125364
Lowercase Letter 125364
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, unknown) take over 50.0%
  • The largest value (unknown) is over 2865.67 times larger than the second largest value (yes)

housing

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2786122

Length

Mean 2.644
Standard Deviation 0.8426
Median 3
Minimum 2
Maximum 7

Sample

1st row no
2nd row no
3rd row yes
4th row no
5th row no

Letter

Count 108902
Lowercase Letter 108902
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (yes, no) take over 50.0%
  • The largest value (yes) is over 21.79 times larger than the second largest value (unknown)

loan

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2770794
  • The largest value (no) is over 5.43 times larger than the second largest value (yes)

Length

Mean 2.2719
Standard Deviation 0.8238
Median 2
Minimum 2
Maximum 7

Sample

1st row no
2nd row no
3rd row no
4th row no
5th row yes

Letter

Count 93574
Lowercase Letter 93574
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%
  • The largest value (yes) is over 6.31 times larger than the second largest value (unknown)

contact

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3021768
  • The largest value (cellular) is over 1.74 times larger than the second largest value (telephone)

Length

Mean 8.3653
Standard Deviation 0.4815
Median 8
Minimum 8
Maximum 9

Sample

1st row telephone
2nd row telephone
3rd row telephone
4th row telephone
5th row telephone

Letter

Count 344548
Lowercase Letter 344548
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (cellular, telephone) take over 50.0%
  • The largest value (cellular) is over 1.74 times larger than the second largest value (telephone)

month

categorical

Approximate Distinct Count 10
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2800784
  • The largest value (may) is over 1.92 times larger than the second largest value (jul)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row may
2nd row may
3rd row may
4th row may
5th row may

Letter

Count 123564
Lowercase Letter 123564
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (may, jul) take over 50.0%
  • The largest value (may) is over 1.92 times larger than the second largest value (jul)
  • month has words of constant length

day_of_week

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2800784

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row mon
2nd row mon
3rd row mon
4th row mon
5th row mon

Letter

Count 123564
Lowercase Letter 123564
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • day_of_week has words of constant length

duration

numerical

Approximate Distinct Count 1544
Approximate Unique (%) 3.7%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean 258.285
Minimum 0
Maximum 4918
Zeros 4
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • duration is skewed right (γ1 = 3.263)

Quantile Statistics

Minimum 0
5-th Percentile 36
Q1 102
Median 180
Q3 319
95-th Percentile 752.65
Maximum 4918
Range 4918
IQR 217

Descriptive Statistics

Mean 258.285
Standard Deviation 259.2792
Variance 67225.7289
Sum 1.0638e+07
Skewness 3.263
Kurtosis 20.2453
Coefficient of Variation 1.0038
  • duration is not normally distributed (p-value 8.285504732825984e-15)
  • duration has 2963 outliers

campaign

numerical

Approximate Distinct Count 42
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean 2.5676
Minimum 1
Maximum 56
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • campaign is skewed right (γ1 = 4.7623)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 2
Q3 3
95-th Percentile 7
Maximum 56
Range 55
IQR 2

Descriptive Statistics

Mean 2.5676
Standard Deviation 2.77
Variance 7.673
Sum 105754
Skewness 4.7623
Kurtosis 36.9752
Coefficient of Variation 1.0788
  • campaign is not normally distributed (p-value 4.478027682123767e-24)
  • campaign has 2406 outliers

pdays

numerical

Approximate Distinct Count 27
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean 962.4755
Minimum 0
Maximum 999
Zeros 15
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • pdays is skewed left (γ1 = -4.922)

Quantile Statistics

Minimum 0
5-th Percentile 999
Q1 999
Median 999
Q3 999
95-th Percentile 999
Maximum 999
Range 999
IQR 0

Descriptive Statistics

Mean 962.4755
Standard Deviation 186.9109
Variance 34935.6873
Sum 3.9642e+07
Skewness -4.922
Kurtosis 22.2266
Coefficient of Variation 0.1942
  • pdays is not normally distributed (p-value 4.570514341603313e-25)
  • pdays has 1515 outliers

previous

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2718408
  • The largest value (0) is over 7.8 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 41188
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 7.8 times larger than the second largest value (1)
  • previous has words of constant length

poutcome

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 3107788
  • The largest value (nonexistent) is over 8.36 times larger than the second largest value (failure)

Length

Mean 10.4537
Standard Deviation 1.3736
Median 11
Minimum 7
Maximum 11

Sample

1st row nonexistent
2nd row nonexistent
3rd row nonexistent
4th row nonexistent
5th row nonexistent

Letter

Count 430568
Lowercase Letter 430568
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (nonexistent, failure) take over 50.0%
  • The largest value (nonexistent) is over 8.36 times larger than the second largest value (failure)

emp.var.rate

numerical

Approximate Distinct Count 10
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean 0.08189
Minimum -3.4
Maximum 1.4
Zeros 0
Zeros (%) 0.0%
Negatives 17191
Negatives (%) 41.7%
  • emp.var.rate is skewed left (γ1 = -0.7241)

Quantile Statistics

Minimum -3.4
5-th Percentile -2.9
Q1 -1.8
Median 1.1
Q3 1.4
95-th Percentile 1.4
Maximum 1.4
Range 4.8
IQR 3.2

Descriptive Statistics

Mean 0.08189
Standard Deviation 1.571
Variance 2.4679
Sum 3372.7
Skewness -0.7241
Kurtosis -1.0626
Coefficient of Variation 19.1848
  • emp.var.rate is not normally distributed (p-value 3.4375178154131496e-17)

cons.price.idx

numerical

Approximate Distinct Count 26
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean 93.5757
Minimum 92.201
Maximum 94.767
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • cons.price.idx is skewed left (γ1 = -0.2309)

Quantile Statistics

Minimum 92.201
5-th Percentile 92.713
Q1 93.075
Median 93.749
Q3 93.994
95-th Percentile 94.465
Maximum 94.767
Range 2.566
IQR 0.919

Descriptive Statistics

Mean 93.5757
Standard Deviation 0.5788
Variance 0.3351
Sum 3.8542e+06
Skewness -0.2309
Kurtosis -0.8299
Coefficient of Variation 0.006186
  • cons.price.idx is not normally distributed (p-value 1.1312223597493702e-09)

cons.conf.idx

numerical

Approximate Distinct Count 26
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean -40.5026
Minimum -50.8
Maximum -26.9
Zeros 0
Zeros (%) 0.0%
Negatives 41188
Negatives (%) 100.0%
  • cons.conf.idx is skewed right (γ1 = 0.3032)

Quantile Statistics

Minimum -50.8
5-th Percentile -47.1
Q1 -42.7
Median -41.8
Q3 -36.4
95-th Percentile -33.6
Maximum -26.9
Range 23.9
IQR 6.3

Descriptive Statistics

Mean -40.5026
Standard Deviation 4.6282
Variance 21.4202
Sum -1.6682e+06
Skewness 0.3032
Kurtosis -0.3587
Coefficient of Variation -0.1143
  • cons.conf.idx is not normally distributed (p-value 4.243974308846448e-15)
  • cons.conf.idx has 447 outliers

euribor3m

numerical

Approximate Distinct Count 316
Approximate Unique (%) 0.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean 3.6213
Minimum 0.634
Maximum 5.045
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • euribor3m is skewed left (γ1 = -0.7092)

Quantile Statistics

Minimum 0.634
5-th Percentile 0.797
Q1 1.344
Median 4.857
Q3 4.961
95-th Percentile 4.966
Maximum 5.045
Range 4.411
IQR 3.617

Descriptive Statistics

Mean 3.6213
Standard Deviation 1.7344
Variance 3.0083
Sum 149153.726
Skewness -0.7092
Kurtosis -1.4068
Coefficient of Variation 0.479
  • euribor3m is not normally distributed (p-value 7.991605856292984e-18)

nr.employed

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 659008
Mean 5167.0359
Minimum 4963.6
Maximum 5228.1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • nr.employed is skewed left (γ1 = -1.0442)

Quantile Statistics

Minimum 4963.6
5-th Percentile 5017.5
Q1 5099.1
Median 5191
Q3 5228.1
95-th Percentile 5228.1
Maximum 5228.1
Range 264.5
IQR 129

Descriptive Statistics

Mean 5167.0359
Standard Deviation 72.2515
Variance 5220.2833
Sum 2.1282e+08
Skewness -1.0442
Kurtosis -0.003906
Coefficient of Variation 0.01398
  • nr.employed is not normally distributed (p-value 1.873183502899342e-17)

y

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2764236
  • The largest value (no) is over 7.88 times larger than the second largest value (yes)

Length

Mean 2.1127
Standard Deviation 0.3162
Median 2
Minimum 2
Maximum 3

Sample

1st row no
2nd row no
3rd row no
4th row no
5th row no

Letter

Count 87016
Lowercase Letter 87016
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (no, yes) take over 50.0%

Interactions

Correlations

Missing Values